Picture for Emad Barsoum

Emad Barsoum

PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation

Add code
Apr 29, 2025
Viaarxiv icon

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Add code
Apr 13, 2025
Viaarxiv icon

DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models

Add code
Apr 12, 2025
Viaarxiv icon

MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM

Add code
Apr 03, 2025
Viaarxiv icon

AMD-Hummingbird: Towards an Efficient Text-to-Video Model

Add code
Mar 25, 2025
Viaarxiv icon

X-EcoMLA: Upcycling Pre-Trained Attention into MLA for Efficient and Extreme KV Compression

Add code
Mar 14, 2025
Viaarxiv icon

Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding

Add code
Mar 13, 2025
Viaarxiv icon

Týr-the-Pruner: Unlocking Accurate 50% Structural Pruning for LLMs via Global Sparsity Distribution Optimization

Add code
Mar 12, 2025
Viaarxiv icon

Partial Convolution Meets Visual Attention

Add code
Mar 05, 2025
Viaarxiv icon

Self-Taught Agentic Long Context Understanding

Add code
Feb 21, 2025
Viaarxiv icon